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In a conventional PAM data transmission system, the transmitted 
signal is x(t) = 2 a n g(t — nT), where \a n \ is a 2 L -level data sequence, 
and g(t) is a Nyquist pulse (g(0) ^ 0, g(mT) = 0,m j* 0). Ideally, the 
bandwidth of the pulse g(t) and, therefore, the bandwidth of x(t) can 
be made equal to 1/2T = p/2L, where p is the data rate. In practice, 
however, an "excess bandwidth" of at least 10 to 20 percent is re- 
quired. 

Using a class of real sequences called "discrete prolate spheroidal 
sequences," we show how to construct a modulated signal with band- 
width just slightly in excess of the optimal p/2L (say, by 2 to 4 percent). 
The new signal is similar in many ways to a conventional PAM signal, 
and in particular an ad-hoc receiver structure is suggested for which 
the resulting error performance is about the same as for a conventional 
PAM system operating in the same environment. 

I. INTRODUCTION 

To fix ideas, consider the following conventional (baseband) PAM 
data-transmission scheme (see, for example, Ref. 1). The data to be 
transmitted is a sequence la*)"*,. The a* are independent identically 
distributed copies of the random variable a, which is uniformly dis- 
tributed on the set {±1, ±3, • • • , ±2 L — l\. Thus, a takes 2 L equally likely 
values, where L = 1, 2, • • • , is a fixed parameter. The modulated signal 
is 

x (t)= £ a k g (t-kTo), (1) 

where go(-) is a real-valued "Nyquist pulse" — i.e., 

£o(0) * 0, 
go (kT ) = 0, k * 0. (2) 

Since (2) implies that x {kT ) = a k g (0), k = 0, ±1, ±2, • • • , the data 
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sequence \a k } can be obtained from x (-) simply by sampling. We assume 
that the Fourier transform 



G (/) = f°Jo(t)e-Wdt, 



-co </< 



(3) 



of g (-) has support on the interval [-F , ±F ], where F < 1/T . Under 
this assumption, the Nyquist condition (2) is known 1 to be equivalent 
to 



•('-#- 



G (f) + G (f-— ) = Tg (0), 



i o 



(4) 



(except perhaps in a set of measure zero). Figure 1 is an example of a real 
G (f) which satisfies (4). An often-used Nyquist pulse G (f) is the so- 
called raised-cosine pulse. (See Ref. 1, pp. 50-51.) The bandwidth of x (-), 
which is the same as the bandwidth of g o (0, is taken as F . The difference 
F - 1/2T is called the "excess bandwidth." 

To conserve bandwidth, it is desirable to make Fo as close to 1/2 To 
as possible, but typically (F - l/2To)/F > 10 to 20 percent in real 
systems. Further reduction in the excess bandwidth is difficult, since 
the very sharp cutoff filter used to generate go(t) with Fo close to 1/2 T 
will introduce either phase distortion or ripples in the amplitude char- 
acteristic. 

In a practical data transmission system for the voice-grade telephone 
channel, a reduction in bandwidth is also desirable, since the channel 
characteristics at the band edges are poor. 

In this paper, we suggest another approach to the signal design 
problem which will allow a further reduction in the excess bandwidth, 
perhaps to as little as 2 to 4 percent. The technique involves a family of 
sequences called "discrete prolate spheroidal sequences" (dpss) and 
is also intimately tied up with notions concerning the space of square 
summable sequences (/ 2 ). Therefore, before presenting our scheme, we 
must digress to review some notions about the space / 2 and to introduce 
the DPSSs. We do this in Sections II and III, respectively. In Section IV 
we discuss our new scheme. 



EXCESS BANDWIDTH 




1/2T F Q 1/T 

Fig. 1 — Example of real G({) satisfying eq. (4). 
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At this point, I would like to acknowledge with thanks three of my 
colleagues without whose help this project could never have gotten off 
the ground. D. Slepian introduced me to DPSSs, and with much kindness 
and no small amount of work helped me to get numerical values for these 
sequences and their associated eigenvalues. J. Mazo taught me most of 
what I know about data communication, and J. Salz's interest and en- 
thusiasm stimulated me to obtain a full understanding of the properties 
of the modulation scheme. 



II. REVIEW OF THE SPACE / 2 

The space I2 of square-summable, real-valued sequences is the set of 
sequences {a (n )}£-_«, (or a(-)) such that 

£ a 2 (n)< 00. (5) 

n = — "> 

Leta(-), b(-) e I2', then the inner product of a(-) and 6(-) is 

(a,b) = £ a(n)b(n). (6) 

n = — od 

Also, the norm of a(«) is 

||a|| = (a, a) 1 / 2 . (7) 

We will need the following facts. For a(-), 6(*), c(-) e h, and any real 
number 7, 

(ya, b + c) = y(a, b) + y(a, c), (8) 

which implies that, for ay e I2, j ' = 1, 2, • • • , 

llLayll 2 = £ ||a y -|| 2 +2 £ (a jt a k ). (9) 

J j j<k 

Further, the Schwarz inequality is, for a, b e I2, 

\(a,b)\ < ||a|| ||6||. (10) 

For a(-) e l>, the (sequence) Fourier transform is defined by 

A T (f)= £ a(n)e- i2 *f Tn , - ro </<co, (n) 

n= — a> 

where T > is a fixed parameter. Of course, Ar(f) is periodic with period 
1/T, and usually we will be concerned only with its values on the interval 
[— (1/27 1 ), (1/2T)]. The sequence \a{n)\ can be recovered from A-r(-) by 
the formula 

a(n) = T f ^ A T {f)e i2 *f Tn df, -co <„<«>. (12) 

J-1/2T 

The convolution theorem states that if 

c(n) = £ a(m)b(n - m) 

m = — f 
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(which we denote c = a*b), then 

C T (f) = A T (f)B T (f), — </ < co, (13) 

where A T , Bt, and Ct are the transforms of a, b, and c, respectively. 
The Parseval relation is, for a,b e l 2 , 

X+1/2T 
A T (f)BHf)df, (14) 

-1/2T 

where "*" denotes complex conjugate. Thus, in particular, 

/-.+1/2T 

||a|| 2 = <a,a> = T f \A T {f)\*df. (15) 

J-1/2T 

We say that a sequence a (•) e / 2 is bandlimited to [0, F], < F < 1/2T, 
if its transform A T (f) = 0, for F < \f\ < 1/2T. Thus, a bandlimited a(-) 
can be written 

a(n) = C F A(f)e i2 *f Tn df. (16) 

A sequence a(-) has support on the interval [iV 1( iV 2 ], — °° < Ni< N 2 
< <», if g (m) = 0, for n $ [Ni, N 2 ]. A sequence with support on 
[N h N 2 ], where \Ni\, \N 2 \ < m , cannot be bandlimited to [0, F] with 
F < 1/2T. 

It is convenient to define the bandlimiting operator on l 2 , % = Bf, 
0<F< 1/2T, by (for a e l 2 ) 

Ba = b, (17a) 

where 

b(n) = f F A T {f)e^f Tn df. (17b) 

In other words, the transform of b(-) is 

AAf), |/| SS (17c) 



Br(/) = 



0, ^l/l^ 



A sequence a e l 2 is bandlimited to [0, F] iff .$/** = a- Corresponding 
to the operator B F , we also define the complementary operator B' = 
!B F = I — jBf, where / is the identity operator. 

We also define the index- limiting (or time-limiting) operator T) = 
T> N (1 < N < co), by (for a e l 2 ) 

Da = b, (18a) 

where 

\a{n), l<n<N, 



(18b) 
0, otherwise. 



b{n) = 
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Thus a e l 2 has support on [1, N] iff S) N a = d. We will need the following 
easily established propositions. 

Proposition 1: Let x(t), — <*< t < °°, be a real-valued function with 
ordinary Fourier transform (as defined by (3)) X(f), — °° <f< °°. Let 
the sequence a(-) be defined by a(n) = x(nT). Then, the sequence 
Fourier transform of a(-) is 

In particular, ifx(t) is bandlimited to F Hz, and 1/T> 2F, then 
A T (f) = \,X(f), |/| si 

Thus the sequence a(«) is bandlimited to [0,F]. 

Proposition 2: Let a(-) e 1% and letg(t) be a real-valued function of 
the continuous variable t. Let 

x(t)= f; a(n)g(t-nT), -oo <£<«>. 

n= — cd 

Then the ordinary Fourier transform of x(t) is 

X(f) = A T (f)G(f), -«</<». 

where A T (f) is the sequence Fourier transform of a(-) and G(f) is the 
ordinary Fourier transform of g{t). 

III. DISCRETE PROLATE SPHEROIDAL SEQUENCES 

Let T, F > (where W = FT < 1/2) and N > 1 be given; let the oper- 
ators # = ftp, 2) = 3) N be as defined in Section II. The following theo- 
rem is proved in Appendix A. 

Theorem 3: There exists a set of real sequences \(f>j(-)\f=\, called "discrete 
prolate spheroidal sequences" (DPSS), with support on [1, N] and a 
corresponding set of real numbers \Xj\i, called "eigenvalues," with the 
following properties. 

(A) 1 > X, > X 2 • • • > \jv > 0, and £ A, = 2FTN. 

7=1 

(B) c A!B(t)j - \j4 Jt 1 <j <N. 

(C) (<f>j, M = b jk 

(D) (%<t>j, $<j> k ) = \j8 jk 

(E) With 8>0, and F, T held fixed, as N ^ <», 

1^ | number of j such that} 
N{ 5 < Xj < 1 - 8 

(F) With e > 0, and F, T held fixed, as N -* °°, 

^2F7W(l-«) ""* 1» 
X2FTN(l + t) ~* 0. 
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0. 



(G) (Slepian [Ref. 4, eq. {63)]), with t > and F, T held fixed, as 

1 - X2FTN(i-«) = exp!-C( € )iV + o(N)\, 

where C(t) > 0. 

(H) The <t>j{-) and \j, 1 < j < N, depend on F, T only through their 
product W = FT. 

Remarks: 

(i) In the course of giving the proof of Theorem 3, we will show ex- 
plicitly how to find the DPSSs {<£/} and the corresponding \Xj\. 

Hi) Theorem 3A, F implies that, with N large, about 2FTN of the X/s 
are about 1, and that the remainder (about (1 - 2FT)N) of the X/8 are 
about 0. Theorem 3G indicates that the convergence as N -*- °° is quite 
rapid. Since this fact is crucial to our modulation scheme, we list some 
of the X/B for FT = V 4 , and various values of N in Table I. Here 2FTN 
= N/2, so that about half of the X,s are 1 and the remainder are about 
0. 



Table I — |X/| for W= 0.25, for N = 5, 10, 20, 50, 100 





i 


x* 


;' 


A, 


N = 5 


1 


0.9976686 


15 


0.0000212 




2 


0.9244132 


1G 


0.0000008 




;s 


0.5000000 


17 


0.0000000 




4 


0.0755868 


18 


0.0000000 




5 


0.0023143 


19 
20 


0.0000000 
0.0000000 


N = 10 


1 


0.9999994 








2 


0.9999490 N = 


: 50 1-21 


>0.9997 




3 


0.9980787 


22 


0.998 




4 


0.9650286 


2:1 


0.985 




5 


0.7326630 


21 


0.914 




6 


0.2673371 


25 


0.680 




7 


0.0349714 


26 


0.320 




8 


0.0019213 


27 


0.086 




9 


0.0000510 


28 


0.015 




10 


0.0000005 


29 
30-50 


0.002 
<0.00023 


N = 20 


l 


0.9999999 








2 


0.9999999 N = 


. 100 1-45 


>0.9998 




:( 


0.9999999 


46 


0.9993 




4 


0.9999999 


17 


0.996 




5 


0.9999992 


48 


0.976 




(i 


0.9999788 


49 


0.892 




7 


0.9995798 


50 


0.664 




8 


0.9940340 


51 


0.336 




9 


0.9435514 


52 


0.108 




10 


0.7070557 


51! 


0.024 




11 


0.2929445 


54 


0.004 




12 


0.0564486 


55 


0.0007 




13 


0.0059659 


56-100 


<0.0001 




14 


0.0004201 
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(Hi) Theorem 3C implies that ||</>,|| 2 = 1, and Theorem 3D implies 
that \\ l B(t>j\\ 2 = Xj. Thus, the fraction of the energy of </>; within the band 
[0, F] is Xj. Therefore, when Xj is close to unity, ; is a sequence with 
support on [1, N] with most of its energy in the band [0, F]. Theorem 3 
implies that, with N large, there are about 2FTN orthogonal sequences, 
i.e., fy ■. (j = 1, 2, • • • , 2FTN(l - e)), with support on [1, N] which are 
approximately bandlimited to [0, F]. 

(iu) Slepian has made an exceptionally detailed study of DPSSs and 
their properties. Reference 4 contains most of his results, and Ref. 5 
describes a Fortran program for computing the DPSSs and their eigen- 
values. 

IV. HEURISTIC DESCRIPTION OF THE MODULATION SCHEME 

Let the data to be transmitted be as in Section I, the 2 L - level sequence 
{«;}"„. We break this sequence into blocks of length i>, where the /eth 
block is «*„+!,■■ ■, tff/e+i),,, — co </j < CDj an d where v is an integer to be 
chosen later. Consider the 0th block a\, • • • , a„. Let N > v be another 
integer parameter, and let F, T > 0, with FT < % be given. Let 
0y, Ay, 1 < j < N, be the DPSSs and eigenvalues guaranteed by Theorem 
3, with parameters N, F, T. Then define the sequence 

/< 
a(n) = £ aj<f>j(n), -« < n < °°. (19) 

j=\ 
Observe that a(-), like the ^ (•), has support on the interval [1, N]. Fur- 
ther, if we take v = 2FTN(l - t), with N sufficiently large so that X„ « 
1, then from Theorem 3 (see remark iii), a(-) is approximately band- 
limited to [0, F]. 

Now the modulated waveform corresponding to the 0th data block 
is 

N 

xo(t) = Z a(n)g(t - nT), -°o <£<«,, (20) 

n=l 

where the pulse g(t) has Fourier transform G(f) which satisfies 

(21a) 




(21b) 

Thus, we do not specify G(f) in the interval [F, 1/2T], except by (21b). 
Since G(f) need not have sharp transitions, it is not difficult to imple- 
ment in practice. For the 6th data block (-«> < k < <*>), a kl , + x , • • • , 
a(k+\),, we set 

a{n) = £ a kl , + j<j>j(n - Nk), Nk + 1 < n < N(k + 1), (22) 

7=1 
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and let the modulated waveform be 



N{k + 1) . . 

x k (t)= L a(n)g(t-nT). (23) 

n=Nk+l 

The entire modulated signal is 

x{t)= £ x fe (t)= £ a(n)g(t-nT). (24) 

Since the number of bits in each data block is L» and each data block 
"occupies" NT seconds, the transmission rate is 

p-(£)fbiWs. (25) 

We now give an intuitive, though imprecise, explanation of the 
properties of the modulation scheme. Consider x (t) given by (20). Its 
Fourier transform is, from Proposition 2, 

X (f) = A T (f)G(f), (26a) 

where 

A T (f) = L a(n)e-n*f Tn 

n=l 

= t aj*jT(f), (26b) 

7=1 

where $/t is the sequence Fourier transform of 0/(-). In the light of 
remark iii following Theorem 3, the \$jAf)\j-i and therefore A T (f) are 
approximately zero for |/| e [F, 1/2T] provided v < 2FN(l - c). Since 
G(f) is bounded in this interval and for |/| > 1/2T, we see that X (/) 
is approximately bandlimited to \f\ < F. Further, if we take v = 
2FTN(1 - «), we have from (25) that the transmission rate p is 2FL(1 
- e). Thus in our scheme we can transmit 2F{1 - e), 2 L - level data sym- 
bols per second with bandwidth F. If e = 0, then we would have effec- 
tively constructed a PAM system with no excess bandwidth. Since, in 
practice, e can be made very small, we can in fact come quite close to the 
ideal. 

So far so good. But we still must show that the data symbols {a;}; can 
be recovered conveniently from x {t). In fact, we claim that the samples 
x (nT) w a(n), 1 < n < N. The key observation here is that, since 
A T (f) * 0, |/| e [F, 1/2T], then X (/) is not appreciably changed when 
G(f) is replaced by G/(/) ("/" for "ideal") where 

_ T, |/| < 1/2T, 
Glif) ~ 0, |/| > 1/2T. (2?) 

The inverse transform of G/ isg/(£) = (sin irt/T)/(irt/T). Let us therefore 
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define x/(£) by replacingg(0 by gj(t) in the definition of x (t). We ob- 
tain 

xi(t)= £ a(n)gj(t - nT), 

n=l 

so that 

xi(nT)=a(n), 1 < n < N. 
It follows that 

x (nT) - a(n) = x (nT) - xjinT), 1 < n < N. 
Now define the sequence c(-) by 

c(rc) = jc (/iT) - x^nT), 1 < n < N. 

Since jc (t) - */(£) is bandlimited to 1/2T Hz, we have from Proposi- 
tion 1 that 

CAf) = \[Xoif)-X l {f)l \f\*^, 
and from Proposition 2 that 

Crif) = \ IXoif) ~ */(/)] 
= J,A T (f)[G(f)-G I (f)]. 
From the Parseval relation (15), 
E [x (nT)-o(n)]2< ||c||* 

n=l 

X1/2T 
, /2T lc r( /)|w 

JF <( < 1/2T 

The inequality follows from (21b), which implies that 

\G(f)-GAf)\ <2T. 

Thus, a(n), 1 < n < N can be recovered from x (t) simply by sam- 
pling. From (19) and the orthonormality of the \<t>j\'[, 

olj = L a(n)<j>j(n), (28) 

n = l 

PAM DATA TRANSMISSION 3285 






so that the {o/jj can be recovered from samples x{nT), 1 < n < N. 

Aside from imprecision, the above arguments completely ignored the 
effects of the other data blocks (k ^ 0) and the effects of channel dis- 
tortion and noise. In the next section, we give a precise definition of the 
modulation scheme and of a proposed receiver, and then state theorems 
that give bounds on the error introduced by linear channel distortion 
and a channel noise. We will also bound the instantaneous power 
x 2 (t). 

V. PRECISE STATEMENT OF RESULTS AND DISCUSSION 

Let {«/}"«, be, as in Section IV, a sequence of independent, identically 
distributed copies of the 2 '-valued random variable a, where 

Pr{« = m\ = 2~ L , m = ±1, ±3, • • • , ± (2 L - 1). (29) 

The sequence {«;) is the data sequence to be transmitted. Let v, N, F, T 
> be parameters such that i>, N are integers, and 

v < N, (30a) 

W = FT < 1/2. (30b) 

Partition the data sequence into blocks of length v , such that the &th 
block is 

k = 0, ±1, ±2, • • • . Let \<t>j(-), \j\jLi be the quantities (DPSSs and eigen- 
values) whose existence is guaranteed by Theorem 3 with parameters 
N, W. Corresponding to the /zth data block, define the sequence a k (-) 
by 

a k {n) = t (* v k+j<l>j(n - Nk), (31a) 

and let 

a(-)= t «*(•). (31b) 

Since </>,(•) has support on [1, N], a k (-) has support on [Nk + 1, 
N(k + 1)]. Finally, the modulated signal is 



where 



x(t) = L x k (t), — < t < oo, (32a) 



x k (t)= L a k (n)g(t - nT), 

n = — m 

= E a k (n)g(t - nT), (32b) 

n=Nk+l 
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and the pulse g(t) is the inverse Fourier transform of G(f), which we 
leave unspecified for now. 

A block diagram for the modulator described above is given in Fig. 2. 
The data symbols appear at a rate of v/NT per second. Box A takes the 
data symbols /' at a time and calculates the numbers \a(n)) — producing 
N outputs for every v inputs. Thus, the a(n) appear at a rate of 1/T per 
second. Box B produces x(t) by modulating the amplitude of a pulse 
train with the \a(n)\. Although we will not specify the ratio v/N and the 
pulse g(t) now, it will be useful to informally think of 

v/N = 2FTN(l - e) = 2WN(1 - t), 

and g{t) ** G{f) as in (21). We will allow the possibility of non-physically 
realizable pulses g(t), with the usual understanding that a close ap- 
proximation to g(t) can be obtained with a finite delay (which we shall 
ignore). 

The received signal is taken as 



y(t) = w(t) + z{t), 



where 



w(t) = 



-S- 



x( T )h c (t - r)dr, 



(33a) 



(33b) 



and where h c (t) is the impulse response of the channel (H c (f), the 
channel transfer function, is the transform of h c (t)), and z(t) is noise 
with zero mean and power spectral density Nz(f)- 

We now turn to the receiver. We will postulate a simple receiver 
structure which, though not optimum, has the virtues of simplicity and 
amenability to analysis. Furthermore it is probably not very far from 
being optimal itself. Refer to Fig. 3. The received waveform y(t) is first 
sampled at t = nT, to produce the sequence \y (nT)\„ = -«,. These samples 
are the input to box C, a tapped delay line with 2M + 1 taps. The output 
of box C is the sequence [d(n)\ given by 



|aj| 


A 


(a(n)} 


B 


x(t) = 2 a ' n >9 (t - nT > 









Fig. 2— The modulator. 



a(n)= 2 c{m)y |(n-m)T) 



y(t)=w(t) + z(t) 



y(nT) 



1/T 



TAPPED 
DELAY LINE 



*JL 



CORRELATOR 

Fig. 3 — The receiver. 
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+M 

d(n)= L c(m)y((n - m)T), -« <n<c°, (34) 

m=—M 

where {c(m)|tj|5 are the tap weight coefficients. As we shall see, the tap 
weights should be chosen so that the sequence &{•) is the receiver's best 
guess of the sequence a(-). 

Consider the Oth block of \d(n)\, i.e., d(l), • • • , d(N). If d(n) = a(n), 
then the Oth block of data symbols «!,•••,«„ could be recovered from 
{d(n)\ using (28). Our approach will be to use these same formulas to 
obtain an estimate {«,) of the jo,}, i.e., 

otj = E d(n)<f>j(n), l<j<v. (35a) 

For the remaining blocks (k ^ 0), we proceed analogously, viz., 

«*„+; = L d(n)</>,(n - iV/z), 1 < ; < v. (35b) 

n=Nk+l 

This is box D. The final step in the demodulation process is a "sheer," 
which examines a/(— °° < j < °°) and emits a,- where a/ equals a value 
of m e {±1, ±3, • • • , ±2 L — 1} which minimizes \aj — m\. 

As in (25), the transmission rate is p = (v/N) • {LIT) bits/s. 

We are now ready to state the properties of the modulation scheme 
in the form of theorems. The proofs of these theorems are given in Sec- 
tion VI. Theorem 4 gives an upper bound on the average power of the 
transmitted signal x (t). Theorem 5 gives an upper bound on the expected 
instantaneous power E x 2 (t), as a function of t. Finally, Theorem 6 gives 
an upper bound on the mean-squared error. We state these results with 
no restrictions on G(f), H c (f), and Nz(f)- In the remarks which follow 
the statement of the theorems, we will look at some interesting special 
cases. 

We begin with a bit of notation. Denote the variance of the data ran- 
dom variable a, defined in (29), by 

al = Ea 2 = 2~ L £ m 2 = ^ L ~ l) ^ L + l \ (36) 

|m.|<2 L - 1 3 

m odd 

Also, for v,N, W = FT satisfying (30), let 

Q = Q(u,N, W)=-± (1-Ai). (37) 

v j=i 

Of course, if we set v = 2WN(1 - e), and let N — • «, with W, € > held 
fixed, then Q -* 0. It will be helpful to think of Q as a small quantity. 
Here are the theorems. Although they may seem formidable at first 
glance, please stick with it! In the extensive discussion following the 
theorem statements, you will see that they can be easily applied and yield 
useful information. 
Theorem 4: (average power) 
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l - r NT 



P AV = E ( x 2 (t)dt 

NT Jo 

Ax= sup ^ f; |g(/-^)| 2 . (38b) 

F<|/|<1/2T 1 z fe=-~ I \ i/l 

Theorem 5: For — ro < £i < °°, 

1/2T 



ExHt ^^Lnr k k G ( f -T) e ' 2 "" 



df. (39) 



Theorem 6: {mean-squared error) 

e^-E± (aj-aj)* = € 2 N +el (40) 

V 7=1 

(N stands for noise, and I for intersymbol interference). The noise error 
€% is bounded by 

^-^ f-F |Cr(/)|2 (*-?- Nz ( /_ r)) df + MQ ' {41a) 

M 

C T (f)~ E c{n)e- i2 *f Tn , (41b) 

Nzif) is the power spectral density of the noise, and 

F<\f\<l/2T ■/ *=-» \ 2/ 

The intersymbol interference error ef is bounded by 

ej < al (^j I" T £* | C T (f)B T (f) - l\*df + A 3 q], (42a) 

where 

Br(/) = ^£_G(/-|)H C (/-|), (42b) 

Cr(/) is giuen fey (4i6), and 

A 3 ^ sup |C r (/)B T (/)-l|2. (42c) 

F<|/|<1/2T 

l?emar&s: 

(i) The reason that P^v as defined by (38a) is the "average power" 
is that the random function x(t) is cyclostationary with period NT. In 
other words, the shifted sequence x(t) — x{t — kNT) has the same sta- 
tistical properties as x(t) itself (for k = 0, ±1, ±2, • • •)• Thus, it follows 
that 

1 /»t/2 1 r»NT 

lim£- ( xHt)dt = -—E \ x 2 (t)dt = P AV . (43) 

r-^co T J-r/2 NT Jo 
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(ii) When Q is small, the upper bound of (38a) on P^y depends es- 
sentially on the folded power spectrum 

Furthermore, if G(f) = for |/| < 1/2 T, then 

k 



IG(/--)=G(/), i/i <f, 



so that Theorem 4 becomes 



P ^-^[S-F^ G{f)l2df + N AlQ ]' (44a) 



where 



Ai- sup |G(/)|2. 

F<|/|<1/2T 



(44b) 



(m) If we assume, as in Remark ii, that G(/) = 0, |/| > 1/2T, then 
Theorem 5 becomes 

a 2 r X ' 2T 

ExHt)<-£ j \G(f)\ 2 df. (45) 

T J-1/2T 

Thus the upper bound on E x 2 (t) depends on G(f) for |/| < 1/2T. 

(iv) Saltzberg's 3 bound can be applied (see Appendix B) to our 
problem to show that the distribution for the instantaneous power 
satisfies 



Pr{|x(t)| 2 >r 2 }<2exp 



r- 



2Ex 2 (t) 

The bound of Theorem 5 can be applied here to further overbound this 
probability. 

(u) We now explain the rationale for using the mean-squared error 
e 2 = (1M 2y' =1 E(otj - aj) 2 . First note that with e 2 = £(«,- - ay) 2 , 
Saltzberg's bound (see Appendix B) can again be used to show that, if 
the noise is Gaussian, then 

P ej = Pr{«; ^ aj) < 2 exp (- -J. 

Since the sequence of random pairs {«y, ay}yL_„ is cyclostationary with 
period v , the overall error probability is 

1 



1 " 2 " 

Pe = - L Pej * ~ E exp | 

j/y=l v j=\ 

Now let e^ax = max ej . Ineq. (46) yields 



2e? 



(46) 
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P e <2exp -— . 

Thus it would appear that €^ax rather than e 2 is the appropriate criterion. 
Nevertheless the use of e 2 (which was, of course, chosen for its mathe- 
matical tractability) can be justified by the following argument. 

Let R be the vX v covariance matrix with (J, fc)th entry E(c\j — a/)« 
(a/j — oik), 1 — j> k — "■ Let Mo be a v X // orthogonal matrix such that 
the diagonal elements of MqRMq are all identical.* One choice of Mq is 
Mo — M1M2, where Mi is a i» X v orthogonal matrix which diagonalizes 
R (i.e., M[RM\ is a diagonal matrix), and Mi is a normalized Hadamard 
matrix (i.e., a j> X // orthogonal matrix with entries il/Vi/). A normalized 
Hadamard matrix is known to exist for all v which are multiples of 4 
up to 200. 

Now modify the communication system as follows. Preceding box A 
in Fig. 2, insert a device which multiplies the data symbols [otj\, taken 
in blocks of v, by the matrix Mq. Then, following box D in Fig. 3, insert 
a device which multiplies the input \c\j\, taken in corresponding blocks 
of length v, by Mq" 1 = M . Let the output of this device be \a'j\. It is easy 
to show that (i) the analysis which yields Theorems 4 to 6 is unchanged 
in the modified system and (ii) E(a'j — ocj) 2 = e 2 , 1 < j < v. 

We must emphasize that the choice of the matrix Mo depends on R 
which in turn depends on the channel which is usually unknown or 
variable. Although it is undoubtedly possible to find an adaptive pro- 
cedure for finding a good matrix Mo, our conjecture is that, in most real 
situations, M can be chosen to be any normalized Hadamard matrix 
with fairly good results. 

(vi) To obtain more insight into our scheme, let us assume that 
Q m 0, and G(f) = H c {f) = N(f) = 0, |/| > 1/2T. Then (38), (41), (42) 
become 



PAV - T j-F |G(/)|2d/ ' (47a) 

€%<- C F \C T {f)\ 2 N(f)df, (47b) 

v J-F 

ef < a\T ^ f F F \^C T (f)G(f)H c (f) ~ 



2 df. (47c) 



We see immediately that our bounds on the important quantities Pa v, 
e 2 N , ef depend on G(f), H c (f), N(f) only for |/| < F and not for F < \f\ 
< 1/2T. In particular, we need a channel whose bandwidth is F. Since 
the transmission rate 

NT N 2FT 2WN 



* Witsenhausen (Ref. 6) has shown that there always exists an Afo with the desired 
property for all v = 1, 2, • • • . 
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we have that 



F =£)m- 



(48) 



Now the ideal bandwidth in a conventional 2 L - level PAM system with 
rate p is (p/2L). We pointed out in Section I that the required channel 
bandwidth in real systems is usually no less than 10 to 20 percent in ex- 
cess of this. For our system with v = 2 WN(l - e), the ratio of the required 
bandwidth given by (48) to p/2L is (1 - e) _1 , which can be made very 
small. See the numerical example in remark vii. 

(uii) Roughly speaking, Theorem 6 tells us that there are three sources 
of error. The first is given by the integral in (41a) which is a bound on 
the error introduced by the noise in the band [-F, F]. The second is given 
by the integral in (42a), which is a bound on the error introduced by the 
imperfections of the channel, as compensated by Cr(f), in the band 
[-F, F] . The third source of error is the fact that Q > 0. The first two of 
these sources appear in conventional PAM systems, but the last is unique 
to our system. The following numerical example shows that Q can in fact 
be made small. 

Let W = 2FT = 0.415, N = 80, v = 64. Then Q(u, N, W) = (1/*) 2) =1 
(1 - \j) = i.oi X 10" 4 , which corresponds to -40.0 dB. The ratio 2 WN/u 
= 1.0375, so that the required bandwidth is 3.7 percent in excess of the 
ideal p/2L. 

Continuing with this example, let us say that 1/T = 6 X 10 3 /s, F = W/T 
= 2490 Hz, and L = 2. Then the transmission rate p = (i>/N) • (L/T) = 
9.6 kb/s. Note that 1/2T = 3 kHz, so that the system performance is es- 
sentially independent of the channel characteristics or noise in the band 
[2.49 kHz, 3 kHz). Of course, we are assuming that H c (f) = N(f) = 0, \f\ 
> 3 kHz. 

Finally, observe that the receiver-correlator (box D in Fig. 3) must 
perform N • v multiplications every N • T second, or u/T multiplications 
per second. For v = 64, and 1/T = 6 X 10 3 , this works out to one multi- 
plication every 2.6 fis. To store the N X u <t>j(n), 1 < n < N, 1 < ;* < v, 
to say 10-bit accuracy, we need a ROM with capacity 10 • N • v = 
51.2 kb. 

(viii) Continuing with the assumptions made in remark vi, let us fur- 
ther assume that G(f) = T, \f\ < F, and H c (f) = 1, |/| < F, i.e., a perfect 
channel response (in band). Also, let N(f) = N /2, \f\ < 1/2T. Then, 
from (47), 

P AV <2FTol 

Further, the upper bound on the total mean-squared error is minimized 
for C T (f) = 1 (i.e., c(0) = 1, c(m) = 0, m ^ 0). Then 

N r F Nn N 
v J-F 2 v 
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If u/N ~ 2FT, the total mean-squared error 

**&** (49) 

Note that (NqF) is the noise power in the band [-F, F] . Observe that in 
a conventional PAM system with a perfect Nyquist equivalent channel 
and additive white noise, the mean-squared error is given the right 
member of (49) with F = the Nyquist bandwidth (see Ref. 1, Chap. 
5). 

(ix) Suppose that it is desired to transmit our data using a modulated 
signal x(t) which is bandpass in the band [Fi, F2]. Then, using quadra- 
ture amplitude modulation (QAM) in a straightforward manner, we can 
modify the present scheme to achieve a bandpass signal. We will now 
outline the procedure. 

Let < Fi < F 2 be given. Set F=(F 2 - F{)/2, and choose T < 1/2 F. 
With F, T so chosen, form two modulated signals (with independent 
data) according to our (baseband) prescription. Denote these baseband 
signals by x (1, (0, x (2) (0- Their rates are each vL/NT. Then form a 
bandpass signal 

x(t)= xW(t) cos 2irF c t + xW(t) sin 2irF c t, 

where F c = (F 2 + Fi)/2. The signal x(t) is essentially bandpass with 
lower frequency 

and upper frequency F c + F = F 2 . The transmission rate for x('t) is 
2i>L „ v L __ / 1>\ L 



p = t^;=2 



«"-(£) *<*-*>■ 



NT N 2FT 
Thus the required channel bandwidth to pass x(t) is 

p /2WN\ 



"•-*>-&<*r> 



exactly as in the baseband case (48). It is a fairly simple matter to analyze 
the QAM system and obtain results analogous to Theorems 4 to 6. 

Another way of accomplishing the synthesis of a bandpass signal is 
to use "bandpass" DPSSs instead of the conventional DPSS characterized 
in Section III. 

(x) Combining (31) and (32), we can rewrite the modulated signal 
as 

x(t)= £ ajg(tj), 

where for kv <j < (k + IK 

Nik + 1) 

g(t. J) = £ 4>j-kAn - Nk)g(t - nT). 

n=Nk+l 
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Note that, for s = 0, ±1, ±2, • • • , 

g(t,j + sv)=g{t + sNT,j), 

so that there are only v possible shapes for g(t,j). 

We conclude from this that the present system is a kind of PAM, with 
the data jay) modulating the amplitude pulses \g(t, j)\. 

(xi) Computation of the Tap Weights: Let us again make the as- 
sumptions of remark ui. Then, to minimize e 2 , it is a reasonable strategy 
to choose the coefficients {c(m)}*f M so that C T (f)G(f)H c (f) is as close as 
possible to unity for |/| < F. Of course, we must take care not to enhance 
the noise by making C T (f) too large. In fact, it is a simple matter to solve 
for the optimal set \c(m)\^ M which minimizes our bound on the total 
mean-squared error (with Q « 0) 

+ rf f'i CrOT 2!flfiJfi.. I |V|. ,50, 



J-F I T \ J 



Let the sequences £ (-), £i(-), &(•) be the inverse transforms of 

Mxm «r(fl, r V >, 

respectively, where 

r(/) (i, l/l^. 

(o, f< 1/1*55;. 

Then the bound of (50) is, from the Parseval relation (15) ("*" indicates 
convolution), 

= -[l|c^oll 2 +4l|c*ei-^ll 2 ] 
V 



N 

V n = - 

+ 



co f r +m "12 

E E c(m')Z (n-m')\ 

= -oo lLm'=-M J 

°l\ £ c^'^i^-^)-^")] 2 ). 

Lm'=-M J J 

Differentiating with respect to c(m), -M < m <M, and setting the re- 
sult equal to zero, yields 

M 

E c ( m ') E £o(rc - "i)£o(" - m') 

m'=—M n= — m 

+ *« E ^i(n-m)^ 1 (n-m / ))= £ £i(" - m)^(n), 

n =-co J M=-co 
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or 

R 1 

m = 0,±l,...,±M, (51a) 



M 

L c(m)n (m - m') = n^irn) 

m=—M 



where 

CO 

Mo(w) = L [£o(n)£o(rc - m) + <r&i(n)£i(ra - m)] (51b) 

n = — co 

Mi(ro) = £ ^(n - fn)b(n). (51c) 

n= — oo 

Clearly, /x (.) is the inverse transform of 

and mi(-) is the inverse transform of (o*/T) G*(f)H' c (f)T(f). The tap 
weights \c(m)\^ M are found by solving the linear equations (51a). 

Of course, the above computation of the tap weight coefficients is 
possible only when the channel transfer function H c (f) and the noise 
spectrum N(f) are known. In most real applications, these quantities 
are unknown or changing, so that an adaptive learning technique is re- 
quired. 



VI. PROOF OF THEOREMS 

Proof of Theorem 4: Using (32), we have 
1 /*NT 

P * vA Nf B So xHt)dt 

i „ r NT / = \ 2 

= ¥r E i U. Xk{t) ) dt 

(1) 1 » r N T 

te ™ L E ( xi(t)dt 

NI fe=_cc Jo 

(2) 1 ro r -kNT+NT 

- ™ £ ^ ( xg(t)dt 

7VT* « J-kNT 

= NT E f-~ xUt)dt = ivT E X" |Xo(/)|2 ^ (52) 

Step (1) follows from the independence of the jay)"., which implies [see 
(31) and (32b)] that Ex* (*)**.(*) = 0, k ^ k'. Step (2) follows from (32b), 
and the fact that |a fc (n)}^JfcV+i has the same statistics as (a (n)}JLi. Thus, 
Xfc (t ) has the same statistics as x (i - WVT). 
We next apply Proposition 2, which implies that 

X (f) = A T (f)G(f), (53a) 

where 

A T (f)= L aWe-WTn. (53b) 

PAM DATA TRANSMISSION 3295 



Substituting (53a) into (52), we obtain 

Pav = ^E f_jA T (f)\ 2 \G(f)M 

=^j^xr i)/T|G(/)|2| ^ (/)|2d/ 

where we have used the fact that A T (f) is periodic with period 1/T. 
Thus 

=£rr(iJ G K)i>'^ w 

where the last step follows from the fact that the integrand is periodic 
with period 1/T, so that we can change the interval of integration from 
[0, 1/T] to [- 1/2T, 1/2T]. Continuing, we have 

+ ^ £ L,^?l G K)l> (/)|2d/ 

= h + h ■ < 54 > 

Now the second integral / 2 can be overbounded by 

h<^A x n)E f \A T (f)\ 2 df = ^A 1 TE±\\!B'ao\\ 2 , 

NT jF<\f\<i/2T l N T 

where A x is defined by (38b). From (31), with k = 0, and (B'<t>j,%'<j> r ) 
= (1 — A,)fyy, we have 



h^A.E 









(55) 



where Q is defined by (37). 
To overbound Ii, the first term in (54), we again use (31) to obtain 



E\A T (f)\ 2 = E 



L aj*j T (f) 
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where $ jT (f) is the transform of 0/(.). Since E ctjcty = olb jy , we have 

E\A T (f)\*=± <F*\* jT (f)\* 

<<rlN, 
by Theorem 7 (proved in Appendix C). Thus 



(56) 



Substituting (55) and (56) into (54) yields Theorem 4. 

Proof of Theorem 5: Let t t , -» < *i < «, be given. Then from (32) and 
(31), 

N(fe+1) ,- 

*(*i) = E Xfc(ti) = E E E <xk>.+j<t>j(n - Nk^iti - nT) 

fc«-« ft=-a> n =Nk+l 7=1 

E fyin-NWgit.-nT) \. 

n = Nk+l J 

Using £ a/ay = o- 2 .^//, we obtain 

<• r N(fc+i) T2 

£* 2 Ui) = E E^ E tjin-NVgi^-nT) . (57) 

ft=-a> /«i |_/i=N/e+l J 

Now with ti held fixed, define the sequence c(-) by 
c(n)=g(t 1 - nT), -oo<n<oo. 
Also for — » < A < », 1 < j < v, define the sequences 0*y(») by 

<f>kj(n) = <t>j(n - kN), —» < n < °°. 
Thus 0ty(«) has support in the interval [Nk + l,N(k + 1)]. Of course, for 

-oo <k, k' < co, 1 <;,;' < „, 

(<t>kj> (fik'j' ) = 

0, otherwise, 
so that \(t>kj\k.j is a family of orthonormal sequences. Furthermore, the 
term in brackets in (57) is (<f> kj , c), so that (58) can be written 



ExHt 1 ) = a* E E <0A;,c> 2 . (58) 

Letting <£ be the subspace of l 2 spanned by the \<p kj \, and P#c the pro- 
jection of the sequence c into c£, (58) is 

Ex(t 1 ) = a 2 JP#c\\*< ( T 2 Jcp. (59) 

We will now bound ||c|| 2 . 

Define the function Wi(t), —<*> < t < oo, by 

w\(t)=g(t l -t), 
with ti still held fixed. Then 
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c(n) = Wi(nT), 
and Proposition 1 yields 

where W x {f) is the ordinary Fourier transform of w(t). Since Wi(f) = 
G*(/)e -«*■/*!• we have, from (15), 

/*1/2T 

= T \ \C T (f)\ 2 df 

J-1/2T 



12 = 



-;x:;;ii. o -('-r) 



*\ e -i2irftx 



"df. 



Combining this with (59) yields Theorem 5. 

Proof of Theorem 6: We begin by observing that the entire system 
described in Section V (up to the sheer in Fig. 3) is linear and the noise 
is additive and independent of the data. Thus, the error sequence 
\&j - ctj\Z a can be written as the sum of two sequences (j8,}"» and (?/}"». 
The sequence |/S/) is data dependent and is of the form 

/' 
In fact, the sequence |/3/j is the output of box D in Fig. 3 when the noise 
2(0 ■ 0. The sequence J-yy] is due to the noise and is, in fact, equal to the 
output of box D in Fig. 3 when we set w(t) = 0. Since the data and noise 
are uncorrelated, so are \fij) and \yj\. Thus the mean-squared error 

e2 = -£E (aj-aj) 2 
= -£Z(fr + 7,) 2 

1 v / " 

v y=i f y=i 

^6/ 2 +6^. (60) 

We will overbound ej and e^/ separately. 

We begin with e%, the error due to the noise. Thus we must over- 
bound 

4i = E-±y}, 

Uj=l 

where \yj\ is the output sequence of box D in Fig. 3 when w(t) ■ 0. Let 
us define the sequence b (-) to be the output of box C when w(t) = 0. 
Then 

Tj = (bo, 4>j)= E 6 o (n)0y(n), l<j <N 

n = l 
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and 

* = ~E t 7/ = " t £ E <t>j(n)4>j(m)E b (n)b (m). (61) 
" /-1 f 7 =/ n =i m =i 

Next observe that &o(-) is a stationary random sequence with 
E b (n) = 0,E b {n)b {m) = R ho (n - m). The sequence Rb Q (n), - « < 
n < °°, is the inverse Fourier transform of its spectral density S bo (f), 
which is, from Proposition 1, 

^o(/) = ^|Cr(/)| 2 ft E m iV z (/-|). (62) 

Returning to (61), we write 

1 1. N r n -1 

ew = ~ £ £ 0y(n) £ #b () (" " m)<f>j(m) \. (63) 

Cy'ln-l Lm=l J 

The quantity in brackets in (63) is dj(n), where the sequence dy(-) is the 
convolution of the sequences R bo {-) and 0y(-). Further, (63) can be written 
as 

INN IN 

4i - - £ L 7 (M)d 7 (n) = - E <0y, dy>. 
From the Parseval relation (14), we have 

T N /»1/2T 

; : 

where <£,r(/)> Djr(f) ar e the transforms of 0y(-), dy(-), respectively. 
Furthermore, the convolution theorem (13) yields 

DjTif) = *j T (f)S bo (f), 

so that (64) becomes 

• 1/2T 



* 2 n = -£ ( D jT {fW jT {f)df, (64) 

/> /=! J-1/2T 



+ -i f s b0 (/)i<iv(/)|2d/. 

Using Theorem 7 (Appendix C), we obtain 

<- C F {TS b0 (f))df+ \ sup S bQ (f)] ~ t W<t>j\\ 2 
N r F , 1 k\ l " 

= - ( \CT(f)\ 2 ZN z (f--)df+A 2 -za-\j), 

V J-F k \ 77 V ,= 1 J 



which is (41). 
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It remains to verify (42), which is an upper bound on the data- 
dependent error or intersymbol interference error ef. Thus, set the noise 
z(£) = 0, and 

ej = E-± (aj-ctj)*. (65) 

vj=i 

We begin by observing that, since the sequences <£,(•), ;' = 1, • • • , f , are 
orthonormal, any sequence c (-) can be written as 

co(-) = t yj+j(-) + r« , (66) 

where <r, 4>j > = and 7; « <Co, 0y>, 1 <; '^ r. Applying (66) to the se- 
quence Did - a), where d(-) and a(-) are as defined in Section V and 
e D = 3) N is the index-limiting operator defined in Section II, we ob- 
tain 

3)(d-a) = t (2)(a-a),<t>j)<t>j + r{-) 

= t (a -a, <t>j)<fij + r, 
where (r, ty ) = 0, 1 < j < v. Thus 

||£(d-a)|| 2 = t (d-a,<f>j) 2 + ||r|| 2 



Now from (31), 

and from (35a), 
Thus (67) is 



>t(d-a,<t>j) 2 . (67) 

7=1 



(a, 4>j) = <*j, l<j<f, 
(d, 4>j) = otj. 



|£(d-a) 2 || 2 > L (&/-«/)', 



so that the mean -squared error, 

e 2 ==-£ £ (aj-aj)<-E\\ c J)(d-a)p. (68) 

v j=\ v 

Ineq. (68) relates the error ef to the error which the system makes in 
transmitting the sequence a(-). 

We proceed to overbound {llv)E\\ c A{d - a)|| 2 . We now define 
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H T (f) = C T (f)B T (f) = | C T (f) J^ G (f - |) H C (/ - 1). 

(69) 
Br(/) is defined by (42b). It is easy to verify that H T (f) is the (sequence) 
transfer function of the overall system from the input to box B at the 
transmitter (Fig. 2), through the channel [defined by (33)], and through 
the sampler and box C at the receiver (Fig. 3). Thus with /i(-), the inverse 
transform olH T (f), 



a(n) = Y. h(n - m)a(m), -co< n <a>. 

m= — •*> 

Further, the error sequence 
d(n)-a(n)= Y a(m)[h(n - m) - 6 n , m ] 



(70) 



= Y a(m)u(n - m), -oo< n <a> ) (7i a ) 

m= — <» 

where the sequence u(-) is defined by 

u(n) = h{n)- 5 n>0 , - °° < n < °°. (71b) 

The transform of the sequence u(-) is 

U T (f) = H T (f) - 1. (72) 

Now, with a k (•) as defined by (31a), we define the convolution of a k (•) 
and u(-) to be 

u k (n) = (a k *u)(n) = Y a k (m)u(n - m) 

m= — <= 
N(k+l) 

- Y a k {m)u{n - m). 

m=Nk+l 

Sincea(-)= £ a*(-), (71) is 

k=-*> 



a - a = a*u = Y Vk- 



(73) 



We next introduce the time-truncation operators D (k \ — °° <k < <», 
defined by 

b (n), Nk + l<n<N(k + 1), 
0, otherwise. 

Of course, £ = £(°>. Then, from (73), 



(Z)Wb )(n) = 



(74) 



-||#(d-a)|| 2 = -||£<°>(d-a)|| 2 
v v 



Y v {0) v k 



fe=- 



= - £ (2 ( \2<\). (75) 

v k,k'=—» 
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Now let us observe that u k depends (through a k ) on the data symbols 
in and only in the fcth data block 

Since all the data symbols are assumed to be statistically independent, 
we conclude that Vk and u k ' (k ^ k') are also statistically independent 
and uncorrelated. Thus 

and (68) and (75) are 

<? I <-E\\3)(&-a)\\ 2 = - £ £||D (0) u*ll 2 . (76) 

We now make another observation about the sequence Uk(-)- As we 
observed in the proof of Theorem 4, the N random variables {a*(n)}Ji. ( fcV+i 
have the same statistics as the N random variables |a (n)}£=i. It follows 
that, for —<*> < n < », Uk(n) has the same statistics as vo(n — Nk), so 
that 

E\\£«»v k \\ 2 = E L u 2 k (n) 

n=\ 

= E £ vl{n-Nk) = E Z oftn) 

n=l n—Nk+1 

= E\\Z)(-Vv Q \\ 2 . (77) 

Substituting (77) into (76), we have 

e?<- £ £||£<-*WI 2 = -E\\u Q \\ 2 . (77a) 

vk — - v 

Now from the convolution formula (13) and the Parseval rela- 
tion (15), 

E||i;oll 2 = -Ellao*"ll 2 
= TE f \A T {f)\ 2 \U T (f)\ 2 df. (78) 

J-1/2T 

Since 

A T (f) = t <*j*j T (f) 

and the {ay}; are uncorrelated, 

£|A T (/)| 2 = £ ^|#/t(/)I 2 - (79) 

7 = 1 

Substituting (79) and (78) into (77), we have 

4 S -T f 1/2T |t/ T (/)| 2 £ a 2 |*y T (/)| 2 rf/ 

j; J -MIT j=\ 
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<^TN 



+ ^ 



;=i Jf<\/\<: 
Using (72) and (69), which define U(f), and the definition of A 3 (42c), 



fN\ 



2 r> 

= -K |£M/)| 2 L I*,t(/)| 2 <*/ 

/' J-F 7 =i 

2 

+ — TL f |lM/)| 2 |*;r(/)| 2 d/ 

V j=i JF<\(\<\/2T J ' 

J]j£M/)| 2 d/ 
^T sup |(/ T (/)| 2 £ f | <*,(/) W 

" F<|/|<1/2T 7=1 */F<|/|<l/2T 

we obtain 

e 2 < «r 2 (y) r J]_* |C T (/)fl r (/) - 1| 2d/ 

Since \\%'4>j || 2 = 1 - Xj, we have established (42), completing the proof 
of Theorem 6. 

APPENDIX A 
Proof of Theorem 3 

Let T, F > 0, with W ± FT < 1/2 be given. Let N = 1, 2, ■ ■ • , also be 
given. Define the sequence y(-) by 

. . sin 2-jrWn 

y(n) = , -co< n <co. (80) 

irn 

The transform of y(-) is easily seen to be 

T T (f)= £ y(n)e-^fTn=\ 1 ' M - F (81) 

0, i^< \f\ < — . 

The bandlimiting operator 'A = % F is therefore defined by b = 33a, where 
B T (f) = T T {f)A T (f). 

Let K" be the NXiV matrix with (m, n)th entry 7(71 - m), 1 <n, m < 
N. Consider the matrix eigenvalue equation 

Ka = Xa, (82) 

where a = (a l9 a 2 , • ■ • , a N )'. Equation (82) is equivalent to 

N 

£ 7(n - m)« m = Xa n> 1 < n < N. (82') 

m=l 

Since K is a symmetric matrix, eq. (82) or (82') has N (not necessarily 
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distinct) real eigenvalues Xi > X 2 > • • • ^ Xyv and a corresponding set of 
N real orthogonal eigenvectors ctj = (a/i, • • • , a,w)^lj< ;' < N. We as- 
sume that the eigenvectors are normalized so that a)ay = bjy. Also we 
can write 

$T y(n - m)a jm = X,a, n , 1 < n < N. (83) 

m=l 

We can now define our sequences )0/(*)l- Let 

*m-\T \t nSN ' < M > 

J 10, otherwise. 

Equation (83) is therefore 

£ y(n - m)4>j{m) = \j<l>j(n), 1 < n < N, (85) 

n=l 

which is equivalent to 

£%<t>j=\j<t>j, l<j<N, (86) 

where 3) = ®n, and 3 = !B F . This is Theorem 3B. 

Now, for 1 < ;' < N, let c ; (-) = £0;. Then (86) implies that 

Cj(n) = \j<t>j{n), \<n<N. (87) 

Thus 

N N 

{Cj, 0>> = L c ; -(n)tfy(n) = L \j(t>j(n) 

n=l n=l 

= X ; ||^|| 2 =Xy. 

Further, since the transform of c ; -(-) is C, T (/) = T T (f)Qj T (f), the Parseval 
relation (14) yields 

X1/2T 
■1/2T 

= T C F p \*jV)\ 2 df. (88) 

From (88), \j < UjW 2 = 1 and X, > 0. In fact, if X, = 0, then *;(/) = 
for |/| < F. But, since *,(/) is a polynomial in e~ i2 ^ T , it vanishes on an 
interval only if it vanishes identically, which contradicts ||</>; || = 1. Thus 
\j > for 1 < ;' < N. Since 2f X y = trace K = 2WN, the (\/}f can be la- 
beled so that they satisfy Theorem 3A. 
Now Theorem 3C follows from 

Theorem 3D is established as follows: 

J1/2T 
Cj T (f)C' kT (f)df 
-1/2 T 
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/»1/2T 

= T I r T (f)*jT(f)*kT<f)df 

J-1/2T 

(1) 

(2) 
- \(<t>j, <t>k) - X/Sy* • 

Step (1) follows from the fact that <pk has support on [1, TV] so that for 
any a(-), {a, (f>k > = (2)a, 4>k). Step (2) follows from Theorem 3B. 
To prove Theorem 3F, observe* that 

£ X/ = trace K = 2WN (89) 

7=1 

and 

£ X? = trace (K*K) = £ iHn - m). 

y=l n,m=l 

Substitution of the formula for y(n) (80) and a simple computation 
yields 

Z\f>2WN-0(\ogN), (90) 

i 

as 7V —*■ 0°. Combining (89) and (90), we have 

1 N 1 N « (loe AT) " 

^EMl-^-iE^-x^^f^— o. (9i) 

Let S = i/: 5 < X ; < 1 - 5}. Then (91) yields 

1 i JV N 

- 6 2 (card S)<-T A,(l - \j) -+ 0, 

iV JV i 

which is, on dividing by 5 2 , Theorem 3E. 

Theorem 3F follows directly from Theorems 3A and 3F. Theorem 3G 
is established in Ref . 4. Finally, Theorem 3H follows immediately from 
the definition of the <t>j{-), X,-, 1 < j < N. 



APPENDIX B 

Saltzberg's Bound 
Saltzberg's bound 2,3 states that if £ is a random variable defined by 

£i ■ E Vjocj, 

where jay}", are i.i.d. copies of the r.v. a, defined by (29), and jiy} are 
fixed coefficients, then the moment generating function of £i 



M^(s) =Ee s ^ > exp 



(s 2 rt 



, s > 0, (92a) 



* This trick is used in Ref. 7. 

PAM DATA TRANSMISSION 3305 



where 



* 



(92b) 



The right member of (92a) is the moment-generating function of a 
Gaussian r.v. with zero mean and variance a\. 

Now let £ = £1 + £ 2 , where £1 is as above and £ 2 is a Gaussian r.v. with 
zero mean and variance o%. Let £1 and £ 2 be statistically independent. 
Then, from Saltzberg's bound (92), the moment-generating function of 

Mi(s) = M^s) • M^s) < exp j|- (af + of)}, s > 0. 
It follows from the Chernoff bounding technique that, for r > 
Pr(£ > rj < exp 



JM+.W <93) 

Let us now apply (93) to establish the claims made in remarks in and 
V in Section V. In remark iv, observe that x(t) is a random variable of 
the form of £1, i.e., a linear combination of the data symbols {ay}. If we 
apply (93) with £1 = jc(t), £ 2 = 0, we obtain the inequality of remark 
iv. 

Next consider Remark v. Observe that, due to the linearity of the 
system, the error ay - ay is of the form of £, with a\ + a\ = t). Thus (93) 
yields 

Pr{(ay - ay) > 11 < exp 



*9 



Since &j ^ ay, only when |ay - ay | > 1, we have 



P ej = Pr{ay ^ ay} < 2 exp 



-1 



2<? 



APPENDIX C 

Theorem 7: Let |0y (-)lyt 1 be an orthonormal set of sequences (in l 2 ) with 
support on [1, N]. That is, D N <t>j = 0y and (fy, 4> k > = fyfc, ■? ^ /> ^ #• 
Let $jt(D, _0 ° </ < co ,bethe Fourier transform of <£,(•), 1 < j < N. 
Then 



N 



-00 </< 



£ |<*>yr(/)| 2 = N, 

Proof: Let u(n) = e~ i2 *f Tn or according as n e [1, N] or n $ [1, N]. 
From the orthonormality of the [+jff, we conclude that they span the 
N-dimensional space of complex-valued sequences with support on 
[1, N]. Thus we can write 



N 



v(n) - L Uj4>j(n), 
y=i 



— 00 < n < °°, 
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where 

N 

uj= (u, <t>j) = Y. v(n)4>j(n) 

= £ e- i2 *f Tn 4>j(n) = $ jT (f), 1 < ; < N. 

n=— en 

Furthermore, the orthonormality of the |</> ; ) also implies 

£ IwM 1 - £ Wj\*-£ \* JT (f)\ 2 . 

Since |u(n)| = 1, we have established the theorem. 
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